DOCUMENT RESUME 



ED 291 793 



TM Oil 086 



AUTHOR 
TITLE 

INSTITUTION 

SPONS AGENC7 
PUB DATE 
CONTRACT 
NOTE 



PUB TYPE 



EDRS PRICE 
DESCRIPTORS 



Hough, Leaetta M* 

Overcoming Objections to the Use of Temperament 
Variables in Selection. 

Army Research Inst, for the Behavioral and Social 
Sciences, Alexandria, Va. 
Department of the Army, Washington, D.C. 
87 

MDA-903-82-C-0531 

32p.; Paper presented at the Annual Meeting of the 
American Psychological Association (95th, New York, 
NT, August 28-September 1, 1987). 
Speeches/Conference Papers (150) — Reports - 
Research/Technical (143) 

MF01/PC02 Plus Postage. 

*Job Performance; Military Personnel; *Personality 
Measures; *Personnel Selection; *Predictive 
Measurement; Predictive Validity; Psychometrics; 
*Test Construction; Test Validity 

ABSTRACT 

Much of the scientific community has believed thai 
temperament variables could not be included in batteries of tests to 
predict job performance because no grneralized principles could be 
discerned from the results. For Project A, a major Army project on 
the prediction of job performance, a temperament inventory was 
developed and implemented. This inventory overcomes objections 
through a carefully chosen research strategy involving: (1) a 
literature review to study predictor constructs; (2) an inventory of 
non-sensitive items and scales designed to detect intentional 
distortion of self-description; (3) a criterion-related validity 
study of the job-related nature of the temperament scales; and (4) 
examination of the effects of motivational sets on scale score and 
criterion-related validities. Through these stringent requirements, 
objections to the use of temperament variables were overcome for 
Project A. Tables of constructed scales are included. (SLD) 



*********<:************************************************************* 

* Reproductions supplied by EDRS are the best that can be made * 

* from the original document. * 
*********************************************************************** 



RIC 



rsi 
o 

UJ 



Overcoming Objections to the Use of 
Temperament Variables in Selection 



Presented by Leaetta M. Hough 
Personnel Decisions Research Institute 



Symposium: New Perspectives on Personality and 
Job Performance (Chair, R. c. Page) 



95th Annual American Psychological Association Convention 
August 1987, New York City 



r«L1S=nr ■h'"' '""-"COO .. 



© 1987, Leaetta Hough 



"PERMISSION TO REPRODUCE THIS 
MATERIAL HAS BEEN GRANTED BY 



TO THE EDUCATIONAL REaO;iRCES 
INFORMATION CENTER (ERIC) " 



A major part of this research was funded by the Armv Researrh Tncti+,.+o 
Contract No. MDA-903-82-C-0531 . All statements Sessed in tnf nan^r' 
are those of the author and do not necessar y exjfess tSe Sff cUl 
opinions of the U.S. Army Research Institute or the Department o^lhe 



ERIC 



BEST COPY AVAILABLE 



Overcoming Objections to the Use of 
Temperament Variables In Selection 



In 1982, I was assigned responsibility for developing temperament, 
biodata, and interest measures for Project A, a major research project 
funded by the Army Research Institute to improve prediction of job 
performance of Army enlisted personnel. 

When we started, much of the scientific community believed it would 
be a waste of time to include temperament variables in a selection 
battery. There were at least five sources of negative opinion. First, 
in 1966 Guion and Gottier published an article in Personnel Psvrhningy 
that affected the scientific community's attitude and knowledge about 
the usefulness of temperament variables for predicting job performance 
criteria. They reviewed the criterion-related validities of temperament 
variables and concluded that, though tc.Tiperament variables have 
criterion-related validity more often than can be expected by chance, no 
generalized principles could be discerned from the results. 

A second source of negative opinion about temperament variables 
came in the form of a theoretical cnallenge. In 1968, Walter Mischel 
published his highly influential book that caused an intense examination 
of and debate over trait conceptions. Mischel asserted that the appar- 
ent evidence of cross-situational consistency of behavior was a function 
of the use of self report as the measurement approach, that traits were 
an illusion. He proposed "situationism," stating that behavior is 
explained more by differences in situations than differences in people. 

Thus, in 1982 much of the scientific community was persuaded by the 
published nL.rature and believed that temperament measures had little 
theoretical merit and were of little practical use. Even those who 

1 



thought temperament measures might have some merit were concerned that 
temperament scales migH be inappropriate and unfair to people who were 
protected under the 1964 Civil Rights Act. In addition, many people 
worried about intentional distortion of self descriptions in an appli- 
cant setting. 

Equally important and negative was the lay community's perception 
of temperament inventories. People objected to offensive items and 
resented being asked to respond to such items. Researchers had been 
sensitized by the lay community's negative reaction to temperament 
inventories and were legitimately leery of antagonizing the public. 

This was the environment in 1982. 

Now, in 1987, Army generals are asking us to implement the tempera- 
ment inventory we developed. What did we do to bring this about? 

RESEARCH STRATEGY 
A lot of time and effort was required. We also had a research 
strategy. That strategy is outlined on page two of your handout. I'd 
like to describe that approach and some of our findings. The research 
strategy was construct oriented and included four basic steps: (1) a 
literature revi w to identify predictor constructs that were likely to 
predict job performance criteria important to the Army, (2) the develop- 
ment of a temperament inventory that consisted of nonsensitive items and 
scales designed to detect intentional distortion of self descriptions, 
(3) a criter ion -related validity study to identify temperament scales 
that were job-related, and (4) an examination of the effects of motiva- 
tional sets on scale scores and criterion-related validities. 



Literature Review 

Predictor and criterio n taxonomies . Since our approach was 
construct oriented for both predictors and criteria, we needed a taxon- 
omy for both predictors and criteria. The criterion categories were 
education, training, job involvement, job proficiency, and adjustment. 
For the predictors, we started with the structure initially found by 
Tupes and Christal (1961) in the early 60s. Following Hogan's thinking 
in the early 80s, we split one of the constructs into two. Thus, our 
predictor taxonomy consisted of six constructs: Surgency, Affiliation, 
Adjustment, Agreeableness, Dependability, and Intel lectance. 

Categorization of t^mpgrament scales. Once we had a predictor 
taxonomy, our next step was to categorize existing temperament scales 
into the classification scheme. From articles and manuals, we obtained 
hundreds of correlations between temperament scales. We categorized the 
temperament scales into the six categories and a miscellaneous cate- 
gory, and then refined the classifications through an iterative process 
of classifying and reclassifying temperament scales to maximize the mean 
within-category correlations and minimize the mean between-category 
correlations. The results of this process are shown in Table 1 of your 
handout. The circles in the diagonal show the mean within-category 
correlations which are in the .30s and .40s and are, in all cases, 
higher than the mean between-category correlations. 

Heta analysis pf crHeri on-related valiHifioc Our next step was 
to summarize the criterion-related validities according to these con- 
structs; Table 2 of your handout shows the results. It is a meta analy- 
sis of the criterion-related validities of scales within each predictor 
construct for each criterion construct. As you can see, several temper- 
ament constructs correlate with the criteria. Note that there are three 



additional predictor constructs. These three, "Achievement." "Masculin- 
ity," and "Locus of Control." were all a part of the miscellaneous 
category. Whpn we summarized the validities for the miscellaneous 
category, we found respectable validities there too, so we looked more 
closely at the scales included in the miscellaneous category and found 
these additional three constructs. 

The results in this table are different from the results that Guion 
and Gottier obtained. We believe that our strategy of summarizing the 
validities according to both predictor and criterion constructs accounts 
for the difference in results. To test this hypothesis, we summarized 
the validity coefficients in our database without regard to construct 
and obtained a coefficient of essentially zero, quite different from the 
coefficients in Table 2. We believe this demonstrates the importance of 
constructs as organizing principles for examining and understanding the 
literature on the criterion-related validity of temperament variables. 
We used the results in this table to guide us in selecting predictor 
constructs to measure. 

Development of Temppra.i ent Scales 

The next step in our research strategy was to develop measures of 
the constructs that the literature review indicated were likely to 
predict criteria important to the Army. List 1 of your handout shows 
the substantive scales we developed for each construct. We developed 
measures for six constructs: Surgency, Adjustment, Agreeableness, 
Dependability, Achievement, and Locus of Control. We also developed a 
"Physical Condition" scale and four response validity scales: Non- 
Random Response, Social Desirability, Poor Impression, and Self- 
Knowledge. We developed the Non-Random Response scale to detect inven- 

4 



6 



tories that had been completed carelessly, a "Social Desirability" scale 
to detect intentional distortion that might occur in an applicant set- 
ting or a non-draft setting, and a "Poor Impression" scale to detect 
intentional distortion that might occur in a draft setting. We called 
the inventory the ABLE, short for Assessment of 3ackground and Life 
Experiences. 

We revised the items and scales in the ABLE many times. People 
representing a variety of perspectives reviewed the items for sensitive 
content. We also pretested the scales three times, each time evaluating 
and revising the items and scales based on soldiers' verbal feedback, 
item response distributions, internal consistency estimates, and test- 
retest reliabilities. The scale statistics for the ABLE scales appear 
in Table 3 of your handout. The average number of items in a scale is 
15. The median alpha of the substantive scales is .81, and the median 
test-retest reliability of the substantive scales is .78. Table 4 sum- 
marizes the ABLE substantive scale statistics as well as correlations of 
the ABLE substantive scales with each other and with other components of 
the four-hour predictor battery. The only part of the predictor battery 
that the ABLE substantive scales correlate with in any sizable way are 
other ABLE substantive scales. The ABLE substantive scales appear to be 
tapping a part of the predictor domain not tapped by other measures. 

Demonstra tion of Job-RelatednP^^ 

The next step in our research strategy was to demonstrate the job- 
relatedness of our temperament scaUs. We conducted a concurrent valid- 
ity study during the summer and fall of 1985. Over 9000 soldiers com- 
pleted the 4-hour predictor battery that included measures of cognitive 



ability, spatial ability, perceptual psychomotor ability, work environ- 
ment preferences, interests, and temperament. 

Criterion-related validities . The criterion measures, the develop- 
ment of which was a major part of the research project, were developed 
by a different part of the research team. The criterion composites are 
very briefly described in List 2 of your handout. There are five 
composites: Core Technical Proficiency, General Soldiering Proficiency, 
Effort and Leadership, Personal Discipline, and Physical Fitness and 
Military Bearing. The first two consist mainly of work samples and 
knowledge tests. The other three consist of supervisory and peer rat- 
ings and information obtained from personnel records. 

Table 6 of your handout shows the criterion-related validities of 
the ABLE scales for these five criteria. The results suggest that 
Achievement scales are the best predictors of the "Effort and Leader- 
ship" criterion; Dependability scales are the best predictors of the 
-Personal Discipline" criterion; and Physical Condition is the best 
predictor of the "Physical Fitness and Military Bearing" criterion, 
though the Achievement scales also correlate with this criterion. These 
three criteria include the supervisory and peer ratings. The other two 
criteria Core Technical Proficiency and General Soldiering Proficiency, 
which consist of work sample and knowledge tests, are not predicted with 
the ABLE substantive scales. 

Table 7 in your handout shows the criterion -related validities of 
the differep* types of predictors included in the study. It shows the 
multiple correlations of each type of predictor with each of the five 
criteria. As you can see, the best predictors of the supervisory and 
peer rating criteria, that is. Effort and Leadership, Personal Disci- 
pline, and Physical Fitness and Military Bearing, are the ABLE substan- 

6 

ERIC 8 



tive scales. The other conclusion from this table is that the ASVAB 
mental ability test and the ABLE temperament inventory are th- two best 
predictors of the criterion domain. 

Fairness 

We next turned to the issue of fairness. Are the items and scales 
fair for groups prDtected under the 1964 Civil Rights Act? The mean 
scores for whites, blacks, and His panics appear in Table 8 of your 
handout. As you can see, minorities do not tend to score lower than 
whites on the ABLE scales. Our efforts to write items that were not 
biased against minorities appear to have been successful. We're 
currently conducting differential validity and fairness analyses; those 
analyses, however are not yet complete. 

examination of Fffprt s of Motivational SPt. 

The fourth component of our research strategy involved investigat- 
ing several issues related to motivational set. A frequent criticism of 
self-report Inventories is that respondents can intentionally distort 
their responses. When respondents are applicants, this is an especially 
important criticism because the criterion -related validities might be 
negatively affected by distorted responses. We therefore studied the 
impact of motivational set on criterion-related validities, the extent 
to which applicants distort their self descriptions, and the usefulness 
of the four response validity scales to detect and adjust for motiva- 
tional set. 

Fakinq study . First, we conducted an exoeriment in which soldiers 
were instructed to respond honestly or to distort their responses in a 
specified way. The participants in the experiment were 245 enlisted 



ERIC 



soldiers at Ft. Bragg. The design was a repeated measures with faking 
and honest conditions counter- balanced. We performed a multivariate 
analysis of variance on the ABLE scales and found that soldiers can 
distort their responses when instructed to do so. 

We then examined the extent to which the response validity scales 
detected intentional distortion. Table 9 of your handout shows the 
results. The last two columns show the effect size of the difference 
between honest and fake good and honest and fake bad. Effect size can 
be interpreted in standard deviation terms. Thus, the difference in the 
honest and fake good condition for Social Desirability is essentially 
one standard deviation; the Social Desirability scale detects distortion 
in the fake good condition. As you can see, the Non-Random Response, 
Poor Impression, and Self -Knowledge scales detect distortion in the fake 
bad condition. 

We next examined the extent to which we could use the response 
validity scales Social Desirability and Poor Impression to adjust ABLE 
substantive scales for faking. Table 10 shows the effect of regressing 
out Social Desirability in the fake good condition and the effect of 
regressing out Poor Impression in the fake bad condition. Median values 
are reported in this table. The .49 in the upper left-hand cell indi- 
cates that the median difference in ABLE scores between the honest and 
fake good condition before regressing out Social Desirability is .49 or 
half a standard deviation. That is, ABLE scale scores differ by about 
half a standard deviation in the fake good condition as compared to the 
honest condition. The next number to the right shows that after regres- 
sing out Social Desirability from the fake good condition, the ABLE 
substantive scales differ from the honest condition by only .14 or just 
over one-tenth of a standard deviation. 

8 



10 



The next two values to the right show the results for the honest 
and fake bad conditions. Clearly, the Social Desirability and Poor 
Impression scales can be used to adjust substantive scale scores for 
intentional distortion. 

These data demonstrate that: (1) people can distort their res- 
ponses to temperament scales, (2) response validity scales can detect 
such distortion, and (3) the response validity scales can be used to 
adjust temperament scale scores for distortion. 

We then asked, to what extent do applicants distort their res- 
ponses? To answer this question, we compared scale scores of 121 Army 
applicants with scale scores cf two groups of soldiers who had no motive 
for distorting their responses. Table 11 shows the results. On the 
substantive scales, applicants actually scored l^wer than one or both 
groups of soldiers 9 out of 11 times. These data suggest that appli- 
cants do not appear to distort their responses. 

Nevertheless, we examined the effects of inaccurate self descrip- 
tions, as detected by the response validity scales, on criterion-related 
validities obtained in the concurrent validity study. Table 12 shows 
that validities for the group detected as responding in a random way are 
significantly lower than validities for the group responding conscien- 
tiously. Table 13 shows the increment in validity when Social Desira- 
bility is used as a moderator variable. Table 14 shows the increment in 
validity when Poor Impression is used with each substantive scale in a 
multiple correlation. The data in these three tables indicate that the 
response validity scales do improve, modestly, the validities of the 
substantive scales even in a concurrent validity study where there is 
little motive to distort one's self description. 



Project A researcners are currently conducting a predictive valid- 
ity study which will provide an opportunity to evaluate the validities 
of the ABLE substantive scales and the usefulness of the response valid- 
ity scales in a selection situation. 

Summarv 

We overcame objections to the use of temperament variables in 
selection by: 

1. reviewing the literature using a construct-based approach to 
Identify useful temperament constructs in previous criterion- 
related validity studies; 

2. focusing scale development on constructs that are likely to predict 
criteria important to the client; 

3. developing scales that consist of items acceptable to the public; 

4. developing scales that are not biased against minorities; 

5. developing scales that are psychcmetrically good; 

6. developing response validity scales to detect inaccurate self des- 
criptions; 

7. evaluating job-relatedness of scales by demonstrating criterion- 
related validity; 

8. developing and evaluating "adjustments" to substantive scale scores 
based on response validity scale scores, and; 

9. evaluating the effect of motivational set on scale scores and 
criterion-related validities. 

10 



ERIC 



12 



REFERENCES 

Gui M., & Gottier, R. F. (1966). Validity of personality mea- 

sures in personnel selection. Personnel Psychology . 18, 135-164. 
Mischel, W. (1968). Personality and assessmpnt . New York: Wiley. 



11 

ERIC 1 3 



HAND-OUT 



Overcoming Objections to Use of Temperament 
Variables In Selection: Demonstrating 
Their Usefulness 

by Leaetta H. Hough 
Personnel Decisions Research Institute 



American Psychological Association Coi..dnt1on 
New York, August 1987 

A major part of this research was funded by the Army 
Research Institute^ Contract No. MDA-903-82-C-0531 . All 
statements expressed In this paper are those of the author and do 
not necessarily express the official opinions of the U.S. Army 
Research Institute or the Department of the Army. 

1 



ERLC 



14 



RESEARCH STRATEGY: CONSTRUCT ORIENTATION 



1. Review Literature 

o Develop predictor taxonomy 

o Classify temperament scales 

o Develop criterion taxoho ny 

o Summarize criterion-related validities according 

to predictor and criterion constructs 
o Identify useful predictor «:onstructs 

2. Develop Temperament Scales 

o Examine items for sensitive content 

o Develop response validity scales to detect 

intentional distortion 
o Pretest 

o Examine psychometric characteristics 
o Revise 

3. Demonstrate Job-Relatedness 

o Conduct concurrent validity study 
o Compute criterion-related validities 
o Conduct differential validity analyses 
o Conduct fairness analyses 
o Conduct predictive validity study 

4. Examine Effects of Motivational Set 
o Evaluate fakability of scales 

o Evaluate response validity scales 

o Evaluate moderator effects of response validity scales 

o Develop ''adjustment" formula 

o Assess effects on criterion-related validities 



^^^'^ ' Co?relaSo^:'^S^^?°^y ^^^^^^^ Category 
correlations of Temperament Scales 



SUrgency 








Adjuscffloac 


Mean -.20 

V 

N^-321 


Mean^-^^ 
SD^-.19 
M^-165 




Agrecabl«ness 


Mean -.04 
r 

S0^-.17 

M -173 
r 


Mean^-.24 

SO -.16 
r 

• N^-162 


SD^-.14 






OependablXic/ 


Mean —.08 
r 

SO^- .16 
M^- 286 


Mean^«.13 
S0^-.20 
M^-276 


Mean -.06 
r 

S0^-.17 
M^-166 


Mean^-^3^ 

SO ».ld 
r 

H "121 
r 




Iatell«ctanc« 


Mean -.12 
r 

SD^-.15 
M^-175 


Mean^-.02 
SO^-.IA 
N^-193 


Mean -.04 
r 

SD^-.16 
M^- 94 


Mean —.12 
r 

SO - .18 
r 

U - 162 
r 


Mean^«^^^ 
S0^-.19 
M^- 52 




sanation 


Mean^-.09 
S0^-.21 
W^-157 


Mean^-.OO 
«^-150 


Mean^-.IO 
SD^-.17 
M^- 98 


Mean -.OS 
r 

SO -.14 ' 
r 

n -160 

r 


Mean^— . J4 
SO^- .15 
H^- 84 


'^can^.^3^ 
S0^«.16 
N^- 45 


MXscellaneoufl 

L 


Mean^-.09 
SD^-.17 
M^-J92 


Mean^-.12 
S0^«.18 
M^-419 


Mcan^-.02 
S0^-.18 
M^-215 


Mean^-.02 

SO -.18 
r 

M^«361 


Mean^".04 

S0^..17 

H -24 2 
r 


Mean^— ,04 
SO^- .15 
Hp- 208 


Surgency Adjustncnc Agreeable- 

nes9 


OcpGiid.l- 
bllU/ 


InCeilcc*- 
tancc 


AfftlU- 
tlon 



"«an^-^05j 
SD^-,20 
M^-246 

Miscel- 
laneous 



'^^^ ^ Criterion-Related Validity studies ^ 

Tliat Used Temperament Predictors 



Cri terfon 



Predictor 

2 

Construct 
*Surgency 
AffUiacion 
*Adjustfnent 
*A9reeableness 
^Dependability 
*Intellectance 



Educational 



Nunber 
Predictors r 



42 
5 

44 
9 

24 
6 



.15 
-.04 



26 



.01 



.15 



.18 



Training 



Number mean 
Predictors r 



47 
0 

44 
5 

26 
7 



• 08 



.16 



.10 
.11 
.14 



Job 

Involvement 



Himber mean 
Predictors r 



21 
4 

21 
4 

18 

8 



.04 
.06 



,13 



.02 



.17 



-.10 



Job 

Proficiency 



Nunber nean 
Predictors r 



175 
16 
146 



.04 
-.01 

ED 



48 -.01 

102 [In] 



32 



.01 



Negative Adjustmenr 



Del inquency 



Number nean 
Predictors r 



a 

0 
10 

1 

10 

1 



Achieveflient 


8 


s 


4 


.33 




4 






0 • • • 


4 


Nasculinity 






3 


.09 


10 


.10 


0 • > • 


3 


locus of Control 


1 


.32 


2 


.29 




7 


.25 




0 • • • 


0 



Substance Abuse 



Nunber mean 

Predictors r 



.27 



.02 



30 
4 
31 
8 

25 
2 



.06 
-.03 
-.07 
-.04 

0 



^ Time Period 1960-1984. 



A star denotes the construct is one of the "Big Five- constructs. 
Note: CorreUtfon. are not corrected for unreliability or range restrictions. 



List 1 



ABLE Scales Organized According to Construct Intended to Measure 



SUBSTANTIVE SCALES: 



Suroency 

. Dominance 
. Energy Level 

Adjustment 

. Emotional Stability 

Aqreeable ness f Likpabil ity) 
. Cooperativeness 

DependabilitY 

. Nondel inquency 
. Traditional Values 
. Conscientiousness 

Achievement 

. Work Orientation 
. Self Esteem 

Locus of fnntrnl 

. Internal Control 

Physical Condition 

. Physical Condition 

RESPONSE VALIDITY SCALES: 

. Non-Random Response 

. Social Desirability 

. Poor Impression 

. Sel f-Knov/ledge 



1 




e entitled 



EMC 



18 



Table 3 ABIE Scale Statistics for Ttotal Group^ 

(Ccfficurrent Sainple; Revised Trial Battery) 



Internal 



ABIE SUBSTANTIVE SCATK 


NO, izeas 


N 


Mean 


S.D. 


Reliability 
fAlrha) 


Test-Petest- 
Reliabilil?^ 


BtctiOTal Stability 


17 


8522 


39.0 


5.45 


.81 


.74 


Self-Esteem 


12 


8472 


28.4 


3.70 


.74 


.78 


Cocperativeness 


18 


8494 


41.9 


5.28 


.81 


.76 


Oc^iscienticusness 


15 


8504 


35.1 


4.31 


.72 


.74 


Nondelinguency 


20 


8482 


44.2 


5.91 


.81 


.80 


Traditicned Vadues 


' 11 


8461 


26.6 


3.72 


.69 


.74 


Work Orientation 


19 


8498 


42.9 


6.06 


.84 


.78 


Internal Control 


16 


8485 


38.0 


5.11 


.78 




Energy Level 


21 


8488 


48.4 


5.97 


.82 


.78 


Dcnunanoe 


12 


8477 


"7.0 


4.28 


.80 


.79 


Physical Oandition 


6 


8500 


14.0 


3.04 


.84 


.85 


ABIE RESPOISE VAUDnv SrATPq 














Social Desirability 


11 


8511 


15.5 


3.04 


.63 


.63 


Self-Rwledge 


11 


8508 


25.4 


3.33 


.65 


.64 


Nan-Randcm Response-' 


8 


9188 


7.4 


1.19 




.30 


Poor Impression 


23 


8492 


1.5 


1.85 


.63 


• ux 



2 i??ioa^i?^ f^^J°^ "^^f^ and random respondii^. 

3 oS.y for ' Non-Ranaam Response test-retest correlations) 



20 



Table 4 ABLE Substantive Scales: Summary 
(Revised Trial Battery) 





Ranae 


Mediae 


Reliability: 








Internal Consistency (Alpha) 


.69 - 


.84 


.81 


Test-Retest 


.69 - 


.85 


.78 


Relationship to Predictor Variables* 








Correlation ABLE Substantive Scales 


.00 - 


.73 


.30 


Correlation Interest Scales 


.00 - 


.43 


.09 


Correlation Preferred Work Environment scales 


.00 - 


.35 


.13 


Correlation Perceptual/Psychomotor Measures 


.00 - 


.13 


.03 


Correlation Cognitive Measures 


.00 - 


.20 


.05 


ASVAB^ Adj. r2 


.01 - 


.04 


.01 



Mental ability test currently used by military. 



21 

. ERIC 



1 



List 2 
Criterion Composites^ 



rrS*^^ ■ supervisory and peer ratings of effort and 

f?ecti5eKess°''anJ\??5J^''""5' f '^^tiveness aSJ red? teS'combat 
achieJeleS?"' ^ '"^ certificates of commendation and other 

''TS"^^ "i-^^^^^"^ ■ supervisory and peer ratings of personal control 
"'peJsoJSli'ile"' actioSs and oth^r negSuSe'ldS?] 

Physical Fitness & Milit ary Bearing - a) supervisory and nPPr ratinnc 
Phis.cal fitness and military bearing; 5nd b) Ji;js?cal "Readiness Jefts 



Ind^falfSf'igSs! " ^'^'^ ^'"''^ administered, i.e., 



summer 



ERIC 



22 



Table 6 Validities of ABLE Scales for Job Performance Criteria: 
^Rev^ca,^ m • Zero-Order Correlations 
(Revised Trial Battery; Concurrent Validity study) 



Predictor 



Criterion 



Cenerat 

Cor€ Technical Soldiering Effort I 
ProHcfencY. Proficiency ieadcrshin 



Physical 
ritnets i 
Personal NiUtary 

Oiscipl ine Bearing 



Sur^ency: 

• Do&inance 
Achxevemenc: 

• Salf Esteem 

. HarJc Orientation 

• ISnerq^ level 

Adjustment: 

• Eantional Stability 

Asreeableness (Litceabi I i ty) 

• Cooperativeness 

Dependability: 

• Traditional Values ' 

• Non*del inquency 

• Conscientiousness 

Others: 

• Internal Control 

• Physical Condition 



.01 



.01 



.15 



.02 


.01 


.20 


.02 


.02 


.23 


.02 


.02 


.22 


.0? 


.02 


.17 


.01 


.02 


.15 


.03 


.06 


.13 


.05 


.07 


.12 


.02 


.02 


.18 


•04 


.05 


.13 


•.04. 


-.05 


.09 


.13 


.U 


.07 




-.06 


.02 


*.04 


••05 ..15 


•.04 


-.03 


.07 



.12 



Response Validity Scales: 
. Non*Randon) Response 

• Social Oesirabil ity 

• Poor Impression 

• Self 'Knowledge 



CorrclatJo,,, are based on u«cr«n«J d.t. for this scale. H varies fro.- 0«4 ,o 
acai e. 



.02 


.18 


.13 


.20 


.IS 


.21 


.14 


.25 



.16 



.H 



.25 


.16 


.2V 


• H 


.23 


.22 


.13 


.13 


.03 


[29] 


.10 


.02 


.05 


.07 


.15 


•.16 


.05 


.13 



9322 for this 



Mote: N varies froia 7666 to 8477. 



Note: 




A box indicates notable predictor/criterion construct relationships. 

23 



ft 



Table 7 

PredSor.^r"^^^^°"^^ Independent 
Predictor Composites with each of ?ive ?ob 

Performance Criteria 
(Concurrent validity Study) 



Criterion Composites 



Predictor 
Coffpos i tes 

ASVAB^ 

(mental ability test) 

Spatial Abilities 



Core Tcchnicat 
Proficiency 



.62 



.56 



General 
Soldiering 



0 



.62 



Effort & 



Proficiency leadership Disctoline Beari 



Physical 
Fitness & 
Personal Military 
ng_ 



.35 



.26 



.20 



.U 



.11 



Perceptual/Psychomotor 

Abilities (coniputerizcd) 



.5A 



.58 



.30 



.12 



.10 



Work Environment 
Preferences 



.28 



.27 



.20 



.10 



.11 



Tenperament (and 

physical activities scale) 

Interests 



.26 



.3A 



.2A 



S 0 S 



.26 



.U 



.13 



ange, but not corrected for criteric 



Multiple Rs .ire adjusted for shrinkage and corrected for restriction in r 
unreliability. 

^Mental ability test currently used by military. 

Mote: Entries in table are avernged across 9 Army military occupational specialties (MOS) uirh . . 
Tot.l sample is 3902. Sample sizes range from 281 to 570; median . 432. ''''' 



Mote: Boxes denote the two best predictors of the criterion space. 




24 



Table 8 



ABLE Scale Means and standard Deviations Separately for Pace rTrial Battery 
(Revised) 



Black 



Hispanfc 



White 



-Other 





(N • 2227 


* 2256) 


(N « 284 


• 292) 


(N « 5614 


• 5673) 


(N • 328 


• 3323 






SD 


HCon 




hean 


S. 


Mean 


so 


ABLE Substantfve Scales 


















Emotional Stability 


39.3 


4.97 


38.7 


5.25 


38.9 


:s.63 


38.2 


5.47 


Self-Estesm 


28.7 


3.32 


28.7 


3.49 


28.4 


3.83 


27.8 


4.02 


Cooperativeness 


42.6 


5.02 


41.9 


4.92 


41.6 


5.38 


41.6 


5.18 


Conscientiousness 


35.7 


3.68 


36.1 


4.08 


34.7 


4.53 


35.7 


3.80 


Nondelinquency 


45.4 


5.18 


45.0 


5.96 


43.7 


6.11 


44.8 


5.93 


Traditional Values 


27.2 


3.11 


27.0 


3.16 


26.3 


3.95 


26.7 


3.42 


Uorlc Orientation 


43.1 


5.51 


43.5 


5.44 


42.8 


6.31 


43.2 


5.80 


Internal Control 


37.B 


4.55 


38.2 


4.50 


38.1 


5.37 


38.4 


4.54 


Energy Level 


48.6 


5.35 


49.6 


5.49 


48.3 


6.21 


48.2 


5.92 


Dominance 


27.7 


3.86 


27.3 


4.09 


26.8 


4.42 


26.5 


4.15 


Phy$iw««l Condition 


14.4 


2.84 


14.0 


3.11 


13.8 


3.10 


13.6 


3.09 


ABLE ResDonse Validity Scales 


















Social Desirability 


15.8 


3.05 




3.60 


15.2 


2.91 


(TTol 


3.50 


Self 'Knowledge 


26.2 


3.10 


25.4 


3.12 


25.1 


3.39 


25.5 


3.11 


Non- Random Response 


7.6 


0.65 


7.6 


0.68 


7.7 


0.54 


7.6 


0.62 


Poor Impression 


1.4 


1.66 


1.4 


1.57 


1.5 


1.94 


1.6 


1.91 



Note: A box indicates a difference from the white mean of approximately one-half standard deviation or nort. 



25 



Table 9 



ABLE Response Validity Scales: 
Effects of Honest* and Faking* Conditions 

Ft. Bragg 



ABLE Response 
Validity Srale 



Social Desirability 
(Unlikely virtues) 

Self -Knowledge 



Effect Size 

Honest First * -Fake Good First* Fake Bad p^^.f . "akelood' 



H , M s.D. 

109 15.8 3.1 



H M s.D. 

57 20,1 5.8 



H M S.D. 
56 17.8 4.8 



109 29.6 3.6 57 29.7 4.1 56 21.8 5.2 



Non-Random Response 109 7,6 l.o 57 7.0 1,8 



Poor Impression 



109 1,5 2.1 57 1.7 2,2 



56 2.8 2,2 



56 14.6 7.9 



EH 



0^ 



- ,03 



,45 



- ,09 



Effect Size 
Honest vs. 
Fake Bad 



- ,53 



1.85 



3.16 



1-2.671 



*Values are based on the sa.ple that completed the questionnaires under the condition of interest 



o 26 

ERIC 



27 



TABLE 10 



Efrects of Regressing Out Response Validity Scales 
(Social Desirability and Poor Impression) 
in Faking Conditions for ABLE 



Honest vs. Fake Good 
Effect Size 



ABLE Substantive Scales 



Before Adjustment After Adjustment 
-49 .K 



Honest vs. Fake Bad 
Effect Size 



Before Adjustment After Adjustment 
2.10 .45 



28 



Table 11 

* 

Conparfson of Ft. Iraog Honest , Ft. Knox, and HEPS (Applicants) ABLE Scales 



/>BLE Scale 

lesponse Validity Scales 
Social Desirability 
Self-Knowledge 
Non-Randon Response 
Poor lopression 

Substantive Scales 



Ft. Bragg MEPS 
(Honest)* (Applicants) 
lean N Mean 



Self-Esteen 
Cooperativeness 
Consc i ent i ousness 
Non-Delinquency 
Traditional Values 
Work Orientation 
Internal Control 
Energy Level 
Dominance 
Physical Condition 



116 15.91 



116 29.54 



116 7.58 



116 1.50 



Elliot ional Stability 112 66.22 



112 34.77 



112 53.33 



112 46.37 



112 53.24 



112 36.67 



112 59.71 



112 49.48 



112 57.56 



112 35.54 



121 16.63 



121 28.03 



121 7.79 



121 1.05 



118 66.03 



118 34.04 



118 54.60 



118 46.49 



118 54.36 



118 36.97 



118 58.37 



118 51.90 



118 56.67 



118 32.84 



112 32.96 I 118 28.27 



Ft. Knox Total 



lean 



276 16.60 



276 29.64 



276 7.75 



276 1.54 



272 65.05 

272 35.12 

272 54.19 

272 48.97 

272 55.49 

272 37.28 

272 61.40 

272 50.37 

272 57.19 

272 35.41 

272 31.08 



*Scores are based on persons v/ho responded to the honest 
condition first. 



29 



Table 12. Moderating Wfects of Randcm Respondiixf on Correlatians 
BetMeen ABLE Scales and Job Perf onnance Criteria 



Surcrency ; 
Dcnijianoe 

Aehleveament ; 
Self-Esteen 
Vca± Orientation 
Energy Level 

Adiiistanent : 
Bnotional Stability 

Aoreeableness ; 
Oocperativeness 

Dependability ; 
Tlraditianal Values 
Nondelinquency 
Conscientiousness 

Others ; 
Internal Control 
Etiysical Condition 



Effcrt/Leadersihip 
Low Hi^ 
fRandom) fNon-Randcm) 



CRIIERION 

Personal Discipliie 

l£M Hjgh 
fRandoro) (Non-Random) 



Itiysical Fitness/Bearing 
Lov High 
fRandcm^ fflon-RandoB^ 



l-.oo 



TOT 



rro? 



.13 



.00 



-.03 



.15 



.05 



.02 



TIT 



.03 



.09 



TST~I 



I .08 



.18 



.22 I 



.10 



Tl4l 



_azj 



.08 



Tin 



.15 



.17 



.21 I 



.13 



I .03 



ZD 



.09 



-.00 



-.03 



.18 



I .08 



I .20 



.09 



I .10 



.05 



I .16 



N ranges frcm 659 to 675 for grocp scoring low on "Non-Randan Response" scale 
N ranges from 8336 to 8477 for groip scoring hic^i on "Non-Randan Response" scale 
Note: Statistically significant differences at P < .05 is appro>djnately .04. 

hue performed a split group analysis rather than a moderated regression because the 
variable of interest had a highly skewed distribution. 

ERIC 30 



.18 



Tin 



TIT"! 



T25n 



.16 I 



3D 



1 .07 


.13 1 


1 .19 


.i5 1 


.18 


.16 


.09 


.12 


1 .22 


.29 1 


.14 


.14 


1 .05 


.18 1 


1 -11 


.23 ) 


[ .16 


.22 1 



no 



T29 I 



Table 13. Moderatin^Effects of "Social Desirability" Scale on Correlations 
Between ABI£ Scales and Job Perf oznance Criteria 



EffortZLeadership Persona], Discipline IhysicaljFitness/Beirdx^ 
AH[f sr^jf^ ^"-Hiqh High- * Non-High High- * Non-High ■ Higfa3 

Suroenev! 

Doninanoe .15 .14 I .66 .06 I .18 .17 

Achievement i 

Self-Esteem 1' .m .12 .12 I .Si .li t 

. Wotk Orientaticai I .25 7201 .17 .16 I .ii ~:m 

level I .^5 .20 I .13 .15 I .2? TSol 

Dnotional Stability .17 .16 .11 .12 | .Ig .jj | 

Aqreeableness i 

Oocperativeness I' .16 .13 I .20 .21 .14 .12 

Itaditional Vedues 
Nondelijiguency 
Conscientiousness 
Others ; 

Internal Control .13 .12 .12 .15 I .15 "ToTl 

Physical Condition .08 . 09 -.03 -.02 . 28 . 29 

We performed a split group amalysis rather than a moderatecj regression because the 
variable of interest had a. highly skewed distribution. 

2 N ranges froa 5896 to 5997 for groqp scoring Non-High on "Social Desirability" scale 
ranges fran 2428 to 2480 for groip scoring high on "Social Desirability" scale 

^» statistically significant difference at p 5 05 is approdaately .03 

^ ERIC 



1 .14 


.11 1 


1 .26 


.22 1 


1 .18 


.11-1 


.13 


.12 


.28 


.29 


1 .14 


.11 1 


1 


.14 1 


.22 


.22 


1 .24 


.14 1 



Table 14. Incxenental Validities of ABIE Scales When "Itoor Iiipr»jssian" 
Scale is Included in Predictor -Equation 
(liinear Modeir 



Suroeney! 

Dcninanoe 
Aefaievanent ; 

Self-Esteem 

Noric Orlentaticn 

Energy Level 

Bnoticnal Stability 

Aerreeableness i 
Oooperativeness 

Depentaabilitvr : 
Ikaditional Values 
Nbndelinc[uency 
Oonscienticusness 

Otters ; 
Intexnal Co ntr ol 
Riysical Oonditian 



EfforVLeadership 
£ B 



.17 



.18 



TIT 



an: 



TTS" 



TTT 



Tin 



CSnERIOi 
Bessonal Discipline 



TTT 



.16 I 



.21 



.25 



.22 



.26 



Fhysical Fitness/Bearingr 
£ B 



TIT 



TIT 



TTT 



TTT 



'7221 



1 .20 


.22 1 


I .12 


.17 1 


1 .20 


.22 1 






•25 1 


1 .18 


.20 1 


1 .21 


.23 1 


.22 


.22 


1 .14 


.17 1 


.25 


.26 



inn 



Tig") . 



.29 
.23 


.29 
.24 


1 

.22 


.IS 1 
.23 


1 .13 


.17 I 


1 .13 


.17 1 




1 .a3 


.16 1 


1 .29 


.31 1 



H - 8400 

Mote: A statistically significant difference at p < 05 is appnaximately .02 
"2;^ SLrrtSStS -"^r^ of the l-pression. scUe 

•32 



ERIC 



